3574 results found.
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
DFKI Research License
Size:
16.6MB Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
not yet
Written
Corpus,
Language Type:
Multilingual
Languages:
English Hindi
Availability:
Freely Available
License:
CreativeCommons
Size:
1.49 million parallel segments <Not Specified>Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
http://www.cfilt.iitb.ac.in/iitb_parallel/
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
50,000 sentences Production Status:
Existing-used
Use:
Sentence boundary detection
Paper:
N/A
Documentation:
<Not Specified>
Speech
Evaluation Data,
Language Type:
Trilingual
Languages:
Basque Catalan English
Availability:
From Owner
License:
hours
Size:
125 Production Status:
Newly created-finished
Use:
Language Identification
Paper:
N/A
Documentation:
<Not Specified>
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English
Availability:
Freely Available
License:
<Not Specified>
Size:
2 million sentences Production Status:
Existing-used
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>
Written
Tool: discourse coherence model,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA 3.0 US
Size:
351 MByte Production Status:
Newly created-finished
Use:
Discourse
Paper:
N/A
Documentation:
English documentation available in the README file that comes with the dataset.
Written
Tokenizer,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Apache 2.0
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
Yes, in English, publicly available at http://www.nltk.org/Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
50914 words Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>




